Distributing Efficiently the Block-Max WAND Algorithm
نویسندگان
چکیده
Large search engines are complex systems composed by several services. Each service is composed by a set of distributed processing nodes dedicated to execute a single operation required to user queries. One of these services is in charge of computing the top-k document results for queries by means of a document ranking operation. This ranking service is a major bottleneck in efficient query processing as billions of documents has to be processed each day. To answer user queries within a fraction of a second, techniques such as the Block-Max WAND algorithm are used to avoid fully processing all documents related to a query. In this work, we propose to efficiently distributing the Block-Max WAND computation among the ranking service processing nodes. Our proposal is devised to reduce memory usage and computation cost by assuming that each one of the P ranking processing nodes provide top-K/P + α documents results, where α is an estimation parameter which is dynamically set for each query. The experimental results show that the proposed approach significantly reduces execution time compared against current approaches used in search engines. c © 2013 The Authors. Published by Elsevier B.V. Selection and/or peer-review under responsibility of the organizers of the 2013 International Conference on Computational Science.
منابع مشابه
Top-k Query Processing with Conditional Skips
This work improves the efficiency of dynamic pruning algorithms by introducing a new posting iterator that can skip large parts of the matching documents during top-k query processing. Namely, the conditional-skip iterator jumps to a target document while skipping all matching documents preceding the target that cannot belong to the final result list. We experiment with two implementations of t...
متن کاملGPU-Accelerated Block-Max Query Processing
In this paper, we propose a method for parallel top-k query processing on GPU(s). We employ a novel partitioning strategy which splits the posting lists according to document ID numbers. Individual GPU threads simultaneously perform top-k query processing within their allocated subsets of posting lists, the results of the query are merged to give the final top-k results. We further design a CPU...
متن کاملSurgeon's Magic Wand: A Screen Pointing Interactive Method
A novel, non-touch, screen pointing “magic wand” interface is proposed for surgeon’s use in an environment requiring simultaneous display of several patients’ data over a continuous period of time. The magic wand or passive pointing device does not have any active energy source within it (as opposed to a laser pointer) and thus cannot easily be detected or identified. Thus, modeling and simulat...
متن کاملComparison of Map@500 Scores (cw09-a) for Rank-s and Taily Instances. a C R O N Y M S Csi Centralized Sample Index Wand Weighted and Xii
Selective search is a modern distributed search architecture designed to reduce the computational cost of large-scale search. Selective search creates topical shards that are deliberately contentskewed, placing highly similar documents together in the same shard. During query time, rather than searching the entire corpus, a resource selection algorithm selects a subset of the topic shards likel...
متن کاملThe wand versus traditional injection for mandibular nerve block in children and adolescents: perceived pain and time of onset.
PURPOSE The purpose of this study was to compare the perception of pain and time of onset in relation to mandibular alveolar nerve block administered by a computerized anesthesia delivery system (ie, The Wand) and a traditional anesthesia system (ie, the syringe). METHODS This study was conducted according to a split-mouth design, with both types of injections being given to all patients. Sub...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013